Pronoun Resolution and Summary Extraction From English Documents
نویسندگان
چکیده
This paper presents an approach to generate a precise and meaningful summary of a Document in English Language. Here, we adopt a modified Sidner’s Focusing algorithm to perform pronoun resolution. We devise an algorithm to divide any sort of compound and complex sentences into simple sentences. Anaphora resolution is performed on these simple sentences. Then we find the lexical cohesion between pairs of the Noun Phrases whose antecedents have already been found.
منابع مشابه
Iranian EFL learners’ knowledge of overt pronoun constraint in English
The present study aims to investigated Iranian L2 speaker`s knowledge of the overt pronounconstraint (OPC) in English. It also aims to examine L2 learners understanding of UniversalGrammar (Chomsky 1981, 1986, 2000, 2001) and if it is common to all languages. Specifically,this study takes a new look at the L2 acquisition of knowledge of the overt pronoun constraint(Montalbetti 1984) by Persian ...
متن کاملMulti-Lingual Coreference Resolution With Syntactic Features
In this paper, we study the impact of a group of features extracted automatically from machine-generated parse trees on coreference resolution. One focus is on designing syntactic features using the binding theory as the guideline to improve pronoun resolution, although linguistic phenomenon such as apposition is also modeled. These features are applied to the Arabic, Chinese and English corefe...
متن کاملEffect of Pronoun Resolution on Document Similarity
This paper presents a novel effect of Pronoun Resolution on measurement of document similarity. In this paper we have studied the effect of pronoun resolution within the framework of the Vector Space Model and Probabilistic Latent Semantic Analysis. For this purpose we have developed a Benchmark Corpus consisting of documents whose similarity scores have been given by human beings. We measured ...
متن کاملThe Referential Versus Non-referential Use of the Neuter Pronoun in Dutch and English
This paper discusses a corpus-based investigation of the distribution of the thirdperson neuter singular pronoun in Dutch (“het”). We labeled all pronominal occurrences of “het” in a large corpus of documents. On the basis of the annotated corpora, we developed an automatic classification system using machine learning techniques to distinguish between the different uses of the neuter pronoun. A...
متن کاملSimilarity-based Multilingual Multi-Document Summarization
We present a new approach for summarizing clusters of documents on the same event, some of which are machine translations of foreign-language documents and some of which are English. Our approach to multilingual multi-document summarization uses text similarity to choose sentences from English documents based on the content of the machine translated documents. A manual evaluation shows that 68%...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2009